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Abstract 

Setting up the infrastructure to manage a software 
project can become a task as significant writing the soft- 
ware itself. A variety of useful open source tools are 
available, such as Web-based viewers for version control 
systems, "wikis" for collaborative discussions and bug- 
tracking systems, but their use in high-energy physics, out- 
side large collaborations, is insubstantial. Understandably, 
physicists would rather do physics than configure project 
management tools. 

We introduce the CEDAR HepForge system, which pro- 
vides a lightweight development environment for HEP soft- 
ware. Services available as part of HepForge include the 
above-mentioned tools as well as mailing lists, shell ac- 
counts, archiving of releases and low-maintenance Web 
space. HepForge also exists to promote best-practice 
software development methods and to provide a central 
repository for re-usable HEP software and phenomenology 
codes. 



INTRODUCTION 

In high-energy physics, software development is becom- 
ing a discipline in its own right. The community has ben- 
efited from the explosion in popularity of the open source 
software (OSS) paradigm and the many utilities developed 
in that spirit, with experimental collaborations beginning 
to deploy systems developed to aid management of dis- 
tributed OSS development teams. Such systems include 
Web-based bug tracking software, a variety of version con- 
trol systems and a plethora of "wiki" implementations. Var- 
ious collaborations have deployed these tools to varying ex- 
tents. 

However, away from the large collaborations, few small 
research groups have the resources to investigate, configure 
and test such tools, which tend to require expertise in areas 
such as server configuration which are outside the remit of 
most physicists. This need for a substantial initial time- 
investment, combined with a reluctance to introduce unfa- 
miliar new working methods, means that uptake of such 
systems is sparse among small groups. 

CEDAR [1,2] is just such a small collaboration. In 
developing our own software development environment 
for the JetWeb and HepData systems, it became evident 
that such an environment could be extended to set up 
well-integrated development tools for an arbitrary number 
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of similar projects with little replication cost. This sys- 
tem is called HepForge [3], so-named because it provides 
the same facilities for particle physicists that the Source- 
Forge. net [4] service provides for general open source 
projects. Additionally, HepForge provides a convenient 
forum for central archiving of various re-usable HEP pro- 
grams. 



INTRODUCING HEPFORGE 

HepForge is intended for use by small-to-medium sized 
HEP projects, specifically those with the intention to make 
their code re-usable, portable and documented. Very spe- 
cialist code, such as reconstruction software written within 
experimental frameworks, is most likely unsuitable and 
should be maintained instead within the experiment's code 
management system. Analysis routines, too, are typically 
unsuitable for HepForge: a system specifically designed 
for them is the PhyStat [5] repository. Project types suited 
to HepForge would be, for example, jet clustering algo- 
rithms, parton density function (PDF) codes and Monte 
Carlo event generators, and indeed projects currently us- 
ing HepForge's facilities include the KtJet library [28], 
LHAPDF [25] and a variety of event generators and related 
systems. Also suitable, but as yet unrepresented, would 
be e.g. statistics libraries and matrix element calculation 
codes. 

Aside from the project management facilities provided 
by HepForge, we hope to encourage the uptake of more 
standard configuration and build systems within high- 
energy physics. An excellent example of this is the 
GNU "autotools", comprising autoconf, automake and 
libtool, which automate many portability issues in code 
compilation and library management. These tools are ubiq- 
uitous in the open source world, and the familiarity of their 
build procedure is a major boon to developers on OSS 
projects, but they have yet to be widely embraced as part 
of the HEP software process. We hope that through Hep- 
Forge more projects can be encouraged and helped in ap- 
plying such tools to HEP purposes, if for no other reason 
than that such standardisation helps to integrate small tools 
into larger systems, encouraging code re-use, a traditional 
weak area in physics code. 

Our intentions explained, we now move on to describe 
the features offered to suitable projects by HepForge. 



HEPFORGE FACILITIES 

The main design requirement of HepForge is that the ini- 
tial learning curve for users should be very shallow. This 
includes not forcing a particular working pattern on the 
users: they may use as many or as few facilities as they 
wish, although the benefits of using multiple features are 
substantial. 

The primary feature of any system for managing soft- 
ware development should be the facilities provided for 
managing the code itself. HepForge provides several such 
features, of which the first is the provision of the Subver- 
sion [6] and CVS [7] version control systems. Subversion 
is a modern replacement for the well-known CVS system 
and is highly recommended as it solves many of CVS's 
known problems. Both Subversion and CVS can be ac- 
cessed via an anonymous read-only method and a read- 
write developer mode over SSH, as well as through Web- 
based viewers. Full-featured shell accounts can be pro- 
vided for developers on request and a script is provided to 
help in conversion of a CVS repository to the more mod- 
ern Subversion type. Additionally, the GNU Arch [8] and 
Dares [9] distributed version control systems are available. 
Releases of project software can be archived and are au- 
tomatically made available, sorted by version, through the 
HepForge Web interface. 

Project management and source code are linked by the 
use of the Trac [10] bug tracker software, whose primary 
interface is Web-based. Trac allows projects to define de- 
velopment milestones, with due dates, and then for bugs 
(called "tickets") to be registered against a particular mile- 
stone. The ability of users to create, modify and close tick- 
ets can be controlled by the project administrator and devel- 
opers can receive details of how bug fixes are proceeding 
by email. A defining feature of Trac is its strong integration 
with Subversion: projects using Subversion for their ver- 
sion control can automatically view the timeline of changes 
to the code and Subversion commit messages can be used 
to make automatic changes to the associated tickets. 

Trac additionally provides a "wiki" system for collabo- 
rative documentation. Again, users' ability to view, create 
and edit wiki pages can be set by the project administra- 
tor, and the wiki pages can be referenced in Trac's tick- 
ets. While documentation is usually hailed as important 
but is nonetheless neglected, Trac helps to ease this prob- 
lem by the close integration of the wiki system into the bug 
tracker and version control system. While Trac has been 
integrated smoothly into the HepForge system, users keen 
to customise Trac can do so easily, via a plain text config- 
uration file, templates for Trac's Web pages and cascading 
stylesheets (CSS) [11]. 

As well as providing facilities for managing software de- 
velopment, HepForge provides Web space for the public 
presentation of the project. The Web system is also de- 
signed for maximum convenience and is continually evolv- 
ing new features. A project named, for example, "foo" 
will automatically have a HepForge Web page located 



at http://hepforge.cedar.ac.uk/foo/ with the full 
features of normal HTML and CGI scripts, including PHP 
and the Spyce inline Python engine. The displayed name 
of the project, for example "Foo", and meta-data such as a 
project description, categories and keywords can be spec- 
ified via an XML file: eventually this functionality will 
be available through the HepForge Web interface. Project 
Web pages are processed through a set of custom Apache 
2 output filters, which can provide automatic headers and 
footers on all Web pages, will correct mistakes in HTML 
automatically, provide constants support in CSS files, hide 
email addresses from search engines and spam harvesters 
and allow the page source to be written in more relaxed 
syntaxes than HTML. Other filters in development will al- 
low easy automatic highlighting of source code, inline ren- 
dering of LaTeX equations and support for more syntaxes. 

Finally, HepForge provides mailing lists for projects: by 
default a list to allow users to contact project administrators 
and a list for announcements to users are created, but more 
are available on request. The lists can be configured by 
project administrators and list members can manage their 
own subscriptions. 

From the point of view of the HepForge maintainer, most 
system tasks are managed via a carefully designed set of 
shell scripts, which render the system maintenance mini- 
mal. Common tasks, such as addition of new projects or 
users, are highly automated. 

For small projects, HepForge offers several benefits over 
SourceForge and CERN's deployment of the related Sa- 
vannah [12] system. HepForge provides full shell accounts 
and places few restraints on what users may or may not 
do. The Web interface, within the project areas, is almost 
entirely under the project's control. Useful features such 
as Subversion support, the highly usable Trac system and 
the Web filter system are unique to HepForge. The system 
is also in active development and many additional features 
are planned. 

HEPCODE 

HepForge will eventually be used to provide a final por- 
tion of CEDAR, named HepCode [13]. This was origi- 
nally a project to provide access to well-defined versions of 
Monte Carlo generator programs, parton distribution func- 
tions and other high-energy physics calculation codes, but 
with HepForge the idea has expanded beyond phenomenol- 
ogy codes. In its current state, HepCode is simply a list 
of programs, with links to where they can be downloaded: 
eventually HepForge will be used to maintain and search 
this list and to archive released versions of the code. 

CONCLUSIONS 

HepForge is a new development environment for re- 
usable high-energy physics software. It provides a range of 
useful facilities with an emphasis on ease of use and flexi- 
bility, and avoids the setup costs that such a system would 



impose on small projects. HepForge also aims to encour- 
age the uptake of standard build systems and development 
methods in HEP software development. 

HepForge currently hosts the core CEDAR projects 
(JetWeb [14], HepML [15], HZTool [16], HZSteer [17], 
Rivet [18], RivetGun [19] and HepData [20]) as well 
as several external projects, including Herwig++ [21], 
ThePEG [22], Herwig 6 [23], Pythia 6 [24], LHAPDF [25], 
Jimmy [26], fastNLO [27], KJet [28] and RunMC [29]. To 
date user feedback has been extremely positive. Other suit- 
able projects are most welcome to join HepForge. 
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